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What is claimed is: 

A computer-assisted method for evaluating a cluster assignment for an observation, 
comprising the activities of: 

for each of a plurality of observations, obtaining a data set containing no 
mo&than one proxy value for each of a plurality of variables, each variable having 
a plurality of possible values, the data set also containing a cluster assignment for 
the obsei\ation, the cluster assignment identifying one cluster from a plurality of 

"•sssr 

(33 clusters; 

*g for each observation from the plurality of observations, calculating a percent 

]i? of proxy valuesVor the plurality of variables that equals a mode of that observation's 

W corresponding cluster's proxy values for the corresponding variables; and 

p outputting tftf percent for each observation. 

5 
u 

M 2. A computer- assisted method for evaluating a cluster assignment for an observation, 

rf comprising the activities < 

for each of a plurality of observations, obtaining a data set containing no 
more than one proxy value fo\ each of a plurality of variables, each variable having 
a plurality of possible values, tn% data set also containing a cluster assignment for 
the observation; 

for each observation from tfte plurality of observations, estimating a 
purposeful probability that a particular possible value from the plurality of possible 
values for a particular variable will be purposefully provided by observations 
assigned to a particular cluster from a plurality of clusters; and 
outputting each purposeful probability. 

3. The method of claim 89, further comprising the activities of: 



• 
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\ for each observation from the plurality of observations in each cluster from 
th\ plurality of clusters, calculating a serendipity probability for each possible value, 
theVerendipity probability a measure of a probability that an observation in a 
particular cluster will be randomly associated with any one of the plurality of 
possible values for a particular variable; 

flar each observation from the plurality of observations, calculating a ratio of 
the purposeful probability to the serendipity probability; 

for ehch observation from the plurality of observations, calculating a 
logarithm of me ratio to obtain composition analysis score; and 

outputtmg the composition analysis scores for each observation in each 
cluster. \ 

4. The method of claim 89, further comprising the activities of: 

for each observation from the plurality of observations, assuming that before 
the observation can bdmade, the observation has an equal probability of being in 
any identified cluster from the plurality of clusters; 

for each observation from the plurality of observations, assuming that the 
purposeful probabilities are true; 

for each observatiomfrom the plurality of observations, using Bayes' 
Theorem to calculate a Bayesmrobability that a particular observation can be in each 
cluster conditional upon the observation's proxy value to each variable; 



5. A computer-readable medium containing instructions for activities comprising: 
for each of a plurality of observations, obtaining a data set containing no 
more than one proxy value for each Af a plurality of variables, each variable having 
a plurality of possible values, the dataYset also containing a cluster assignment for 



outputting the Bayes probability that each observation can be in each cluster. 
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the observation, the cluster assignment identifying one cluster from a plurality of 
clusters; 

\ for each observation from the plurality of observations, calculating a percent 
of proxy values for the plurality of variables that equals a mode of that observation's 
corresponding cluster's proxy values for the corresponding variables; and 



6. An apparatu\ for evaluating a cluster assignment for an observation, comprising: 
for each of a plurality of observations, means for obtaining a data set 
containing no nrore than one proxy value for each of a plurality of variables, each 
variable having a plurality of possible values, the data set also containing a cluster 
assignment for the observation, the cluster assignment identifying one cluster from a 
plurality of clusters; \ 

for each observation from the plurality of observations, means for 
calculating a percent of proxy values for the plurality of variables that equals a 
mode of that observation' ^corresponding cluster's proxy values for the 
corresponding variables; ana 



7. A computer-readable medium containing instructions for activities comprising: 
for each of a plurality of observations, obtaining a data set containing no 
more than one proxy value for each hf a plurality of variables, each variable having 
a plurality of possible values, the data\et also containing a cluster assignment for 
the observation; \ 

for each observation from the plurality of observations, estimating a 
purposeful probability that a particular posYible value from the plurality of possible 
values for a particular variable will be purposefully provided by observations 
assigned to a particular cluster from a plurality of clusters; and „ 



itputting the percent for each observation. 



means for outputting the percent for each observation. 
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outpVtting each purposeful probability. 

An apparatus for evaluating a cluster assignment for an observation, comprising: 

foreach of a plurality of observations, means for obtaining a data set 
containing nfo more than one proxy value for each of a plurality of variables, each 
variable having, a plurality of possible values, the data set also containing a cluster 
assignment for tfte observation; 

for each observation from the plurality of observations, means for estimating 
a purposeful probability that a particular possible value from the plurality of 
possible values for a particular variable will be purposefully provided by 
observations assigned toV particular cluster from a plurality of clusters; and 
means for outputtimg each purposeful probability. 



